Discovering Concepts in Structural Data

نویسندگان

  • Diane J. Cook
  • Lawrence B. Holder
  • Gehad Galal
چکیده

The explosive growth of databases in scientiic, industrial, and commercial elds has not been accompanied by a similar growth in our ability to analyze and digest this data. The increasing amount and complexity of data creates an urgent need for automatic database analysis tools. This trend is evident in molecular biology data which continues to grow in both size and complexity. This research outlines a general approach to automatically discover repetitive and functional concepts in large structural databases. The Subdue system discovers substructures that compress the database and represent structural concepts in the data. By replacing previously-discovered substructures in the data, multiple passes of Subdue produce a hierarchical description of the structural regularities in the data. To increase the exibility of the system, we describe methods of incorporating domain-dependent information into the discovery process. Because discovery systems such as Subdue are very computationally expensive, we also explore ways of parallelizing the system to improve scalability.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pathology of talent management in urban industries; case study: automotive industries

The purpose of this study is pathology of talent management in the automotive industry, so we identify Challenges and barriers, as well as success factors in this filed. This research is a kind of qualitative study that has been done by coding methodology of qualitative data. We used semi-structured interview to collect data. After collecting data and coding, data are divided into two groups of...

متن کامل

Discovering Structural Patterns in Telecommunications Data

With the increasing amount and complexity of data being collected, there is an urgent need to create automated techniques for mining the data. In particular, data being generated and stored by telecom companies overwhelms scientists' ability to manually discover patterns in the data. Because much of this data is structural in nature, or composed of parts and relations between the parts, linear ...

متن کامل

Discovering the Underlying Components Affecting the Usability of IoT in Iranian Libraries: A Theory Based on Context

Objective: The aim is to discover the underlying context components of IOT usability in Iranian libraries: A qualitative approach consistent with grounded theory. Method: This qualitative study was conducted based on grounded theory. Data were collected through semi-structured interviews with 13 faculty members of knowledge and information science based on purposeful and chain methods. Responsi...

متن کامل

Substructure Discovery Using Minimum Description Length and Background Knowledge

The ability to identify interesting and repetitive substructures is an essential component to discovering knowledge in structural data. We describe a new version of our Subdue substructure discovery system based on the minimum description length principle. The Subdue system discovers substructures that compress the original data and represent structural concepts in the data. By replacing previo...

متن کامل

Developing a grounded-based model of tranquility in contemporary apartments in Urmia City

Introduction: Stressful life and lack of tranquility in modern society, have been serious problems for human life. Environmental psychology has shown that physical and architectural environments play an important role in this, and since the home is one of the most important environments, they try to offer solutions. This study tries to identify the factors that play an effective role in creatin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999